首页> 外文OA文献 >Deep Convolutional Neural Networks and Data Augmentation for Environmental Sound Classification

【2h】

Deep Convolutional Neural Networks and Data Augmentation for Environmental Sound Classification

机译：深度卷积神经网络与数据增强环境声音分类

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
相似文献
相关主题

摘要

The ability of deep convolutional neural networks (CNN) to learndiscriminative spectro-temporal patterns makes them well suited toenvironmental sound classification. However, the relative scarcity of labeleddata has impeded the exploitation of this family of high-capacity models. Thisstudy has two primary contributions: first, we propose a deep convolutionalneural network architecture for environmental sound classification. Second, wepropose the use of audio data augmentation for overcoming the problem of datascarcity and explore the influence of different augmentations on theperformance of the proposed CNN architecture. Combined with data augmentation,the proposed model produces state-of-the-art results for environmental soundclassification. We show that the improved performance stems from thecombination of a deep, high-capacity model and an augmented training set: thiscombination outperforms both the proposed CNN without augmentation and a"shallow" dictionary learning model with augmentation. Finally, we examine theinfluence of each augmentation on the model's classification accuracy for eachclass, and observe that the accuracy for each class is influenced differentlyby each augmentation, suggesting that the performance of the model could beimproved further by applying class-conditional data augmentation.

机译：深度卷积神经网络（CNN）学习判别式频谱时态模式的能力使其非常适合于环境声音分类。但是，标记数据的相对稀缺性阻碍了该系列高容量模型的开发。这项研究有两个主要贡献：首先，我们提出了一种用于环境声音分类的深度卷积神经网络架构。其次，我们提出了使用音频数据增强来克服数据稀缺性的问题，并探讨了不同增强对所提出的CNN体系结构性能的影响。结合数据增强，该模型可产生用于环境声音分类的最新结果。我们显示，改进的性能源于深度，高容量的模型与增强的训练集的组合：这种组合优于不带增强的拟议CNN和带增强的“浅”词典学习模型。最后，我们检查了每种扩充对模型对每个分类的分类准确性的影响，并观察到每种分类的准确性受每种扩充的影响不同，这表明可以通过应用分类条件数据扩充进一步提高模型的性能。

著录项

作者
Salamon, Justin; Bello, Juan Pablo;
展开▼
作者单位

展开▼
年度 2016
总页数
原文格式 PDF
正文语种
中图分类

相似文献

外文文献
中文文献
专利

1. Deep Convolutional Neural Networks and Data Augmentation for Environmental Sound Classification [J] . Justin Salamon, Juan Pablo Bello IEEE signal processing letters . 2017,第3期

机译：深度卷积神经网络和环境增强分类的数据增强
2. Environmental sound classification using a regularized deep convolutional neural network with data augmentation [J] . Mushtaq Zohaib, Su Shun-Feng Applied Acoustics . 2020,第Octa期

机译：使用具有数据增强的正则化深卷积神经网络的环境声音分类
3. Application of Deep Convolutional Neural Networks in Attention-Deficit/Hyperactivity Disorder Classification: Data Augmentation and Convolutional Neural Network Transfer Learning [J] . Zhu Li, Chang Weike Journal of Medical Imaging and Health Informatics . 2019,第8期

机译：深度卷积神经网络在注意力缺陷/多动障碍分类中的应用：数据增强与卷积神经网络转移学习
4. Environmental Sound Classification Using Deep Convolutional Neural Networks and Data Augmentation [C] . Nithya Davis, K Suresh IEEE Recent Advances in Intelligent Computational Systems . 2018

机译：使用深度卷积神经网络和数据增强的环境声音分类
5. Analysing the effects of data augmentation and free parameters for text classification with recurrent convolutional neural networks. [D] . Quijas, Jonathan K. 2017

机译：使用递归卷积神经网络分析数据扩充和自由参数对文本分类的影响。
6. Towards the classification of heart sounds based on convolutional deep neural network [O] . Fatih Demir, Abdulkadir Şengür, Varun Bajaj, 2019

机译：基于卷积深神经网络的心声分类
7. Deep Convolutional Neural Networks and Data Augmentation for Environmental Sound Classification [O] . Salamon, Justin, Bello, Juan Pablo 2016

机译：深度卷积神经网络与数据增强环境声音分类

Deep Convolutional Neural Networks and Data Augmentation for Environmental Sound Classification

摘要

著录项

相似文献

相关主题

期刊订阅